a, b, During all auditory and visual task blocks, low (lowest frequency 200 Hz), medium (561 Hz), and high (1573 Hz) pitched tones were presented with six equidistant pitch levels in each category (pitch range within a category depended on subjective pitch discrimination performance during training). The tones were 200 ms in duration and consisted of two successive 100 ms parts (each part included a 5 ms linear onset and offset ramps). In pitch discrimination tasks (a), the last half of each tone was slightly lower or higher in pitch than the first part (the magnitude of pitch difference depended on the difficulty level). Subjects were required to press a button when the two halves of a tone had the same pitch (target). In the pitch memory tasks (b), the two halves of a tone always had the same pitch and subjects were required to respond when a tone belonged to the same pitch category as the one presented one, two, or three trials before (target in a 2-back task is illustrated). c–h, The task and its difficulty level were indicated by task instruction symbols presented on a screen from 6 s before each block onset until the end of the block. A letter “Λ” (Lambda) or “V” (not shown in the figure) in the middle of task instruction symbols indicated the task modality, auditory or visual, respectively. Pitch discrimination tasks were indicated by one red dot (c–e), while the pitch memory tasks were indicated by two red dots (f–h). In the pitch discrimination tasks, task difficulty level was indicated by the position (yellow rectangles) of the red dot (the leftmost position, easy; second position from the left, medium; rightmost position, hard). For the pitch memory tasks, the distance between two red dots indicated the relative serial positions of the sounds to be compared. For the visual tasks, in addition to the letter V, two red dots were presented at second and third position from the left.